[Spyre-Next] [Cleanup] Rework of CustomOp wrapping by bohnstingl · Pull Request #842 · torch-spyre/sendnn-inference

bohnstingl · 2026-03-17T12:13:26Z

Description

This PR intends to simplify the way CustomOps are currently wrapped for execution on spyre. In addition to a simplification, this rework enables more features from upstream vLLM, for example the compilation interface. This will eventually be needed when larger parts of the model is executed on spyre.

Related Issues

#733

Test Plan

This change should be transparent to users and thus the already existing/running tests should not change.

Checklist

I have read the contributing guidelines
My code follows the project's code style (run bash format.sh)
I have added tests for my changes (if applicable)
I have updated the documentation (if applicable)
My commits include a Signed-off-by: line (DCO compliance)

Signed-off-by: Thomas Ortner <boh@zurich.ibm.com>

…dmul

…r_rework

Signed-off-by: Thomas Ortner <boh@zurich.ibm.com>

github-actions · 2026-03-17T13:08:43Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, run ./format.sh.
Now you are good to go 🚀.

We also recommend installing prek and configuring it to check your code before every local commit.

joerunde · 2026-03-17T20:00:02Z

bot:next-test

joerunde · 2026-03-17T20:03:16Z

+    return prefix
+
+
+def create_rmsnorm_op_pair():


what's the rationale for moving op-specific methods to a generic utilities package? I would expect this to continue living in rms_norm.py as e.g. rms_norm.create_op_pair()

Yeah, I was debating about this myself. The issue here is that although the two functions are really close, upstream vLLM runs the infer_schema function, which requires explicit knowledge about the inputs and those inputs are specific for the CustomOp at hand. I left the _fake_impl function in utils.py, but the op specific function I moved back to the individual files.

joerunde · 2026-03-17T23:24:49Z

^^ looks like tests are failing, mind taking a look into it?

Signed-off-by: Thomas Ortner <boh@zurich.ibm.com>

bohnstingl · 2026-03-18T23:50:03Z

As a general comment, the CustomOp infrastructure will be overhauled once vLLM has its vLLM IR landed. I looked at it and I think we should be in a good position for a quick turn-over once the vLLM IR lands. In fact, the refactor of this PR should simplify this change even further

Signed-off-by: Joe Runde <joe@joerun.de>

joerunde · 2026-03-19T21:52:51Z

        return pytree.tree_map(
-            lambda el: el[:orig_batch_size, :],
-            convert_from_spyre(outs, dtype=x_dtype, device=x_device),
+            lambda el: convert(el, dtype=x_dtype, device=x_device)[:orig_batch_size, :],


I think there was a small bug here where convert wasn't in the transform so in the case where there is a residual, a tuple was being passed to convert which started failing with the new unit tests

Looks good to me! Thank you @joerunde

joerunde · 2026-03-19T21:53:00Z

bot:next-test

joerunde

Nice cleanup!

I've merged in main and 🤞 the tests should be passing and we'll be good to merge

joerunde · 2026-03-19T22:08:33Z

Tests are passing!

@bohnstingl feel free to merge if you're fine with the tiny bugfix I added on the output conversion in rms_norm.py

bohnstingl added 7 commits March 10, 2026 11:48

Added SiluAndMul wrapper

7720517

Signed-off-by: Thomas Ortner <boh@zurich.ibm.com>

Move tensor slicing on CPU to be functional

32f64b7

Signed-off-by: Thomas Ortner <boh@zurich.ibm.com>

Lint issues

b13c9e1

Signed-off-by: Thomas Ortner <boh@zurich.ibm.com>

Updated code documentation

1ea4b03

Signed-off-by: Thomas Ortner <boh@zurich.ibm.com>

Merge branch 'main' of github.com:vllm-project/vllm-spyre into siluan…

414243b

…dmul

Merge branch 'main' of github.com:vllm-project/vllm-spyre into wrappe…

aaefdd4

…r_rework

Reworked and simplified CustomOp wrapper for spyre

3f6aa41

Signed-off-by: Thomas Ortner <boh@zurich.ibm.com>

bohnstingl requested review from joerunde and prashantgupta24 as code owners March 17, 2026 12:13

bohnstingl self-assigned this Mar 17, 2026

bohnstingl mentioned this pull request Mar 17, 2026

Evaluate whether vLLM IR can be used to wrap layers for torch-spyre #733

Open

github-actions Bot changed the title ~~[Cleanup] Rework of CustomOp wrapping~~ [Spyre-Next] [Cleanup] Rework of CustomOp wrapping Mar 17, 2026

joerunde reviewed Mar 17, 2026

View reviewed changes

bohnstingl added 3 commits March 18, 2026 07:25

removed op specific function from utils.py

8c35de1

Signed-off-by: Thomas Ortner <boh@zurich.ibm.com>

Fixed comment in forward_native

37ac38a

Signed-off-by: Thomas Ortner <boh@zurich.ibm.com>

Simplified CustomOps interface

e1649d2

Signed-off-by: Thomas Ortner <boh@zurich.ibm.com>

bohnstingl requested a review from joerunde March 19, 2026 11:12

joerunde added 2 commits March 19, 2026 15:22

Merge branch 'main' into wrapper_rework

e3ff5d9

🐛 fixup small conversion bug

29adf01

Signed-off-by: Joe Runde <joe@joerun.de>

joerunde reviewed Mar 19, 2026

View reviewed changes

joerunde approved these changes Mar 19, 2026

View reviewed changes

bohnstingl merged commit 16a88b7 into torch-spyre:main Mar 19, 2026
14 checks passed

This was referenced Mar 19, 2026

Wrap MLP layer for torch-spyre #736

Closed

[Spyre-Next] Wrapped Embedding layer for spyre #836

Merged

dilipgb mentioned this pull request Mar 31, 2026

[Spyre-Next] [Feature] Wrap RoPE layer on Spyre #881

Draft

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Spyre-Next] [Cleanup] Rework of CustomOp wrapping#842

[Spyre-Next] [Cleanup] Rework of CustomOp wrapping#842
bohnstingl merged 12 commits intotorch-spyre:mainfrom
bohnstingl:wrapper_rework

bohnstingl commented Mar 17, 2026

Uh oh!

github-actions Bot commented Mar 17, 2026

Uh oh!

joerunde commented Mar 17, 2026

Uh oh!

joerunde Mar 17, 2026

Uh oh!

bohnstingl Mar 18, 2026

Uh oh!

joerunde commented Mar 17, 2026

Uh oh!

bohnstingl commented Mar 18, 2026

Uh oh!

joerunde Mar 19, 2026

Uh oh!

bohnstingl Mar 19, 2026

Uh oh!

joerunde commented Mar 19, 2026

Uh oh!

joerunde left a comment

Uh oh!

joerunde commented Mar 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bohnstingl commented Mar 17, 2026

Description

Related Issues

Test Plan

Checklist

Uh oh!

github-actions Bot commented Mar 17, 2026

Uh oh!

joerunde commented Mar 17, 2026

Uh oh!

joerunde Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

bohnstingl Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

joerunde commented Mar 17, 2026

Uh oh!

bohnstingl commented Mar 18, 2026

Uh oh!

joerunde Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

bohnstingl Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

joerunde commented Mar 19, 2026

Uh oh!

joerunde left a comment

Choose a reason for hiding this comment

Uh oh!

joerunde commented Mar 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants